首页> 外文OA文献 >A text recognition and retrieval system for e-business image management
【2h】

A text recognition and retrieval system for e-business image management

机译:一种用于电子商务图像管理的文本识别和检索系统

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The on-going growth of e-business has resulted in companies having to manage an ever increasing number of product, packaging and promotional images. Systems for indexing and retrieving such images are required in order to ensure image libraries can be managed and fully exploited as valuable business resources. In this paper, we explore the power of text recognition for e-business image management and propose an innovative system based on photo OCR. Photo OCR has been actively studied for scene text recognition but has not been exploited for e-business digital image management. Besides the well known difficulties in scene text recognition such as various size, location, orientation in text and cluttered background, e-business images typically feature text with extremely diverse fonts, and the characters are often artistically modified in shape, colour and arrangement. To address these challenges, our system takes advantage of the combinatorial power of deep neural networks and MSER processing. The cosine distance and n-gram vectors are used during retrieval for matching detected text to queries to provide tolerance to the inevitable transcription errors in text recognition. To evaluate our proposed system, we prepared a novel dataset designed specifically to reflect the challenges associated with text in e-business images. We compared our system with two other approaches for scene text recognition, and the results show our system outperforms other state-ofthe-art on the new challenging dataset. Our system demonstrates that recognizing text embedded in images can be hugely beneficial for digital asset management.
机译:电子商务的持续增长导致公司不得不管理越来越多的产品,包装和促销图像。需要用于索引和检索此类图像的系统,以确保可以管理图像库并将其完全用作有价值的业务资源。在本文中,我们探索了文本识别在电子商务图像管理中的强大功能,并提出了一种基于照片OCR的创新系统。 Photo OCR已被积极研究用于场景文本识别,但尚未被用于电子商务数字图像管理。除了场景文本识别中众所周知的困难(例如,大小,位置,文本方向和混乱的背景)之外,电子商务图像通常还具有具有极其多样化的字体的文本,并且通常会对字符进行形状,颜色和排列的艺术修改。为了应对这些挑战,我们的系统利用了深度神经网络和MSER处理的组合功能。余弦距离和n元语法向量在检索过程中用于将检测到的文本与查询匹配,以容忍文本识别中不可避免的转录错误。为了评估我们提出的系统,我们准备了一个新颖的数据集,专门设计来反映电子商务图像中与文本相关的挑战。我们将我们的系统与其他两种用于场景文本识别的方法进行了比较,结果表明,在新的具有挑战性的数据集上,我们的系统优于其他最新技术。我们的系统表明,识别图像中嵌入的文本可能对数字资产管理非常有益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号